The ISME Journal — Latest Matching Preprints

1

Bioimaging And Comparative Genomics Uncover Persistence-Associated Bacteria In A Blood Bank Environment

D Arpino, M. C.; Alonso-Reyes, D.; Grillo-Puertas, M.; Galvan, F. S.; Alvarado, N. N.; Martinez, L. J.; Marranzino, M. G.; Albarracin, V. H.

2026-07-21 health systems and quality improvement 10.64898/2026.07.19.26357333 medRxiv

Top 2%

2.7%

Show abstract

Blood banks represent highly controlled healthcare environments where microbiological surveillance has traditionally focused on blood products rather than environmental microbial reservoirs. Despite their critical role in transfusion safety, the ecology of surface-associated microorganisms and the persistence traits that enable their long-term survival remain poorly understood. Here, we combined scanning electron microscopy, culture-based microbiology, phenotypic characterization, MALDI-TOF mass spectrometry, and whole-genome sequencing to investigate whether surfaces within a public blood bank facility constitute reservoirs of environmentally derived bacteria with enhanced persistence potential. Samples collected from a public blood bank in Tucuman, Argentina yielded 37 culturable bacterial isolates, predominantly Gram-positive environmental taxa together with a limited number of opportunistic Gram-negative species. More than 30% of the isolates exhibited multidrug resistance, while several strains displayed strong biofilm formation, amyloid-like fiber production, motility, and hemolytic activity, indicating multiple phenotypic strategies associated with long-term surface persistence. Whole-genome sequencing of six representative isolates confirmed species identity, identified genes related to antimicrobial resistance, adhesion, biofilm formation, stress adaptation, and cytotoxicity, and revealed frequent genotype-phenotype discordance, highlighting the importance of integrating genomic and phenotypic analyses. Notably, one isolate exhibited less than 92% average nucleotide identity with publicly available genomes, suggesting the presence of a previously undescribed environmental species. Thus, blood bank surfaces function as selective ecological niches favoring bacteria with persistence-associated traits rather than simply reflecting contamination from blood products. These microorganisms may constitute latent biosafety hazards if environmental barriers fail, particularly in facilities handling biological materials intended for vulnerable patients. Our results support the incorporation of integrated bioimaging, phenotypic characterization, and genome-resolved environmental surveillance into infection prevention strategies and transfusion biosafety programs within a One Health framework.

2

How bursty infectiousness shapes epidemic dynamics

Kissler, S. M.

2026-07-17 epidemiology 10.64898/2026.07.15.26358199 medRxiv

Top 3%

1.0%

Show abstract

An epidemic's expected course is determined by the magnitude and timing of a typical person's infectiousness --- captured, in turn, by the basic reproduction number and the generation-time distribution. These fundamental, population-average quantities can mask individual-level variation that shapes how an epidemic actually unfolds: for example, individual variation in the magnitude of infectiousness (overdispersion) creates superspreading, a key feature of the SARS-CoV-1 and SARS-CoV-2 epidemics. However, the impact of individual variation in infectiousness timing is less well understood. Here, we demonstrate that individual infectiousness timing varies substantially and to different degrees across pathogens. For some common pathogens, including influenza, measles, and SARS-CoV-2, infectiousness is "bursty", or highly concentrated and variably-timed across individuals: for example, the window of appreciable infectiousness for SARS-CoV-2 may last for roughly a day, vs. the 9--12 days usually quoted. We show that bursty infectiousness creates superspreading without inherent superspreaders, makes epidemic timing more variable, amplifies the time-sensitivity of common interventions, and complicates inference of key epidemiological parameters. Together with the reproduction number, the generation-time distribution, and overdispersion, burstiness completes a family of basic parameters that govern how epidemics unfold.

3

Municipal wastewater surveillance reveals socioeconomic and immigration gradients in antimicrobial resistance across Alberta, Canada

Lee, J.; Gonzalez, C.; Au, E.; Acosta, N.; Waddell, B. J.; Xu, Z. S.; Clark, R. G.; Weyant, R. B.; Dalton, B.; Zaheer, R.; McAllister, T. A.; Barkema, H.; Nobrega, D.; Bhatnagar, S.; Lee, B. E.; Pang, X.; O'Grady, C.; Frankowski, K.; Bertazzon, S.; Conly, J. M.; Hubert, C. R. J.; Parkins, M. D.

2026-07-21 infectious diseases 10.64898/2026.07.19.26358431 medRxiv

Top 4%

0.4%

Show abstract

Antimicrobial resistance (AMR) is an ever-increasing threat to population health. Industrial, environmental and societal factors are increasingly recognized as important contributors to AMR within communities. Here, we investigated the spatial distribution of AMR genes (ARGs) across Alberta, Canada and their association with socio-economic, immigration-related, and agro-industrial characteristics using municipal wastewater-based surveillance. We analyzed monthly wastewater metagenomes collected between March 2022 and March 2023 across eleven municipalities, representing 39% of Alberta's population. Integration with census data enabled multivariate analysis, revealing that municipal resistome profiles were strongly structured along income and immigration-related population gradients. ARGs spanning 14 resistance classes exhibited distinct distributional patterns across income and immigration gradients, including contrasting associations among beta-lactam, aminoglycoside, and macrolide-lincosamide-streptogramin ARGs, consistent with heterogeneous selection pressures across sub-populations. These findings demonstrate the capacity of longitudinal wastewater surveillance to identify persistent population-level resistome patterns and highlight the importance of incorporating sociodemographic context into AMR surveillance and mitigation strategies.

4

Tumor-Colonizing Microbiota Distinguish Early- and Late-Onset Colorectal Cancer in a Hispanic/Latino Patient Cohort

Manjarrez, S.; Diaz, F. C.; Carranza, F. G.; Waldrup, B.; Ninova, M.; Velazquez-Villarreal, E.

2026-07-21 oncology 10.64898/2026.07.19.26358429 medRxiv

Top 5%

0.3%

Show abstract

Background: Early-onset colorectal cancer (EOCRC) is increasing globally, particularly among Hispanic/Latino (H/L) populations, yet the contribution of tumor-colonizing microbiota to age-associated colorectal cancer (CRC) biology remains poorly understood. Most microbiome studies have focused on fecal communities or non-Hispanic populations, leaving the intratumoral microbial landscape of H/L patients largely unexplored. Methods: We performed an exploratory characterization of tumor-colonizing microbiota using whole-exome sequencing (WES) data from four primary colorectal tumors obtained from H/L patients treated at City of Hope, including two EOCRC (<50 years) and two late-onset colorectal cancer (LOCRC; [≥]50 years) cases. Following removal of host-derived sequences, microbial taxonomic profiling was conducted at the family, genus, and species levels, and microbial metabolic pathways were inferred. Clinical and pathological data were integrated to evaluate age-associated differences in microbial composition and predicted function. Results: Family-, genus-, and species-level analyses consistently demonstrated greater microbial diversity in LOCRC than EOCRC. LOCRC contained more than twice the number of unique bacterial families, nearly three times as many unique genera, and more than twice as many unique bacterial species. A conserved core microbiota, including Fusobacteriaceae, Prevotellaceae, Fusobacterium, and Prevotella, was identified across both age groups, whereas LOCRC was enriched in CRC-associated taxa including Fusobacterium nucleatum, Bacteroides fragilis, Parvimonas micra, Porphyromonas asaccharolytica, and Dialister pneumosintes. Species-level analyses revealed only a single shared bacterial species between EOCRC and LOCRC, indicating progressive microbial divergence with increasing taxonomic resolution. In contrast, functional profiling identified 11 predicted microbial metabolic pathways, of which nine were shared between age groups, two were unique to EOCRC, and none were exclusive to LOCRC. Core metabolic pathways involved in energy metabolism, amino acid biosynthesis, phospholipid metabolism, and central carbon metabolism exhibited comparable abundance across both groups, demonstrating substantial functional conservation despite pronounced taxonomic differences. Conclusions: Tumor-colonizing microbiota differ markedly between EOCRC and LOCRC in H/L patients, with late-onset tumors exhibiting substantially greater microbial richness and taxonomic complexity. Despite these compositional differences, microbial metabolic functions remain largely conserved, supporting the concept of functional redundancy within the colorectal tumor microenvironment (TME). Although exploratory, this proof-of-concept study provides one of the first characterizations of intratumoral microbiota in H/L EOCRC and establishes a foundation for larger multi-omics investigations aimed at identifying microbiome-based biomarkers and therapeutic targets for precision oncology.

5

Intravesical Lactobacillus rhamnosus GG reduces symptoms among people with spinal cord injury and disease who use intermittent catheterization: A randomized comparison of two- and four-dose regimens.

Groah, S. L.; Tractenberg, R. E.; Riegner, C. R.; Forster, C. S.

2026-07-20 urology 10.64898/2026.07.17.26358333 medRxiv

Top 8%

0.1%

Show abstract

Background: Urinary tract infection (UTI) is the most common secondary condition among people with spinal cord injury/disease (SCI/D). Intravesical Lacticaseibacillus rhamnosus GG (LGG) is an antibiotic-sparing approach to managing urinary symptoms. Objective: Determine the optimal number of doses of intravesical LGG for urinary symptom reduction. Design: Prospective, randomized, two-arm dosing trial. Setting: National recruitment with a local subsample providing urine samples in Washington, DC, USA. Participants: Adults with SCI/D and neurogenic lower urinary tract dysfunction (NLUTD) who use intermittent catheterization (IC); 177 enrolled and randomized (intention-to-treat), with 76 compliant instillers (39 low-dose, 37 high-dose) in the per-protocol analytic sample. Interventions: Two (2 doses/24 hours) or four (4 doses/36 hours) intravesical LGG regimens, self-initiated in response to cloudier or malodorous urine per the Self-Management Protocol using Probiotics (SMP-Pro). Main Outcome Measures: Primary: proportion achieving [≥]20% reduction on the Urinary Symptom Questionnaire for Neurogenic Bladder-Intermittent Catheter version (USQNB-IC). Secondary: urinary biomarkers (leukocyte esterase, nitrite, white blood cells, urinary neutrophil gelatinase-associated lipocalin [uNGAL]) and standard urine culture (SUC) in a local subsample. Results: By Day 2, 57.9% (63.8% low-dose; 51.2% high-dose) achieved [≥]20% total symptom reduction; high-dose success rose to 70.0% by Day 4. Thirty percent of high-dose participants did not respond at either time point and could not be distinguished from responders by demographics or urine biomarkers. Urinary biomarkers and SUC were unchanged pre- to post-instillation. No serious adverse events were adjudicated as attributable to intravesical LGG by an independent Data Safety Monitoring Board (DSMB). Conclusions: A two-dose course of intravesical LGG yields clinically meaningful symptom improvement in the majority of people with SCI/D and NLUTD who use IC; four doses benefits a meaningful subgroup of two-day non-responders, while a small cohort remains nonresponsive. These results provide preliminary dosing guidance and support progression to a definitive trial.

6

Molecular and phylogenetic insights into the novel Brugia sp. in Sri Lanka with new evidence for zoonotic transmission

Nimalrathna, S. U.; Harischandra, H.; Kimber, M.; Chandrasena, N.; De Silva, N.; Mallawarachchi, H.; De Silva, B. G. D. N. K.

2026-07-21 infectious diseases 10.64898/2026.07.20.26358473 medRxiv

Top 11%

0.0%

Show abstract

The World Health Organization (WHO) validated Sri Lanka had eliminated lymphatic filariasis as a public health problem in 2016, the second country in Southeast Asia to attain this status. However, post-validation surveillance has identified sporadic cases of brugian filariasis. The reemergence of Brugia malayi infections in Sri Lanka warrants urgent investigations. Recent studies have shown that the parasite responsible for the reemergence is a novel zoonotic Brugia sp. maintained among dogs that is closely related but distinct to the human-infecting B. malayi species. The current study employed morphological and morphometric assessments, revealing that this novel zoonotic Brugia sp. is within the B. malayi morphological range. Molecular characterization of three genomic regions, the nuclear genomic region SLXI, the non-coding region HhaI, and the mitochondrial genomic region COXI confirmed it as a genetic variant more closely related to B. malayi than to B. pahangi. Phylogenetic analysis further indicated it as a distinct genomic variant, closely related to a B. malayi-like parasite reported from India. Notably, that same parasite was identified in infected humans, animals, and potential vector mosquitoes. This, together with the detection of both human and animal blood within the same brugian infective mosquitoes, and delineating the canine origin of the parasites in human infections, provides compelling evidence supporting zoonotic transmission of this parasite. To our knowledge, this is the first report demonstrating the presence of the same brugian parasite in humans, domestic animals, and potentially infective mosquitoes in Sri Lanka, supported by multi-genomic evidence. The recent identification of multiple potential mosquito vector species suggests that this parasite may have undergone adaptive changes, facilitating its ability to overcome the species barrier. These findings substantiate the long-held hypothesis of zoonotic transmission of the reemerged brugian parasite, highlighting significant implications for ongoing surveillance and control strategies.

7

Comparing Human and Large Language Model Responses to Patients Online Questions: Towards Multi-dimensional Patient-centered Support

Hussein, M. A.; Doshi, R.; He, L.; Reynolds, T.

2026-07-17 health informatics 10.64898/2026.07.15.26355314 medRxiv

Top 11%

0.0%

Show abstract

Patients and caregivers seek informational and emotional support throughout medical care, especially when interpreting unfamiliar laboratory test results. Although resources such as patient portals and online health communities (OHCs) help address questions, gaps remain. The emergence of large language models (LLMs) offers the potential to be a complementary source of support to assist patients and caregivers in understanding and using their test results. The objective of our study is to empirically compare LLM responses to patients online questions containing their laboratory test results to responses written by peers in an OHC. We compared the 519 peer replies to 122 laboratory test-related posts from an OHC to 488 responses generated from four LLMs using mixed computational and qualitative methods. LLMs frequently provided clear explanations of medical terminology and structured interpretations of numeric results but were longer and less readable. Peers offered more personalized, context-specific emotional support. Overall, LLMs have the potential to complement peer responses in OHCs, but require greater emotional depth, reasoning transparency, and alignment with community norms.

8

FootNet: A Multi-View Smartphone Dataset and Four-Model Benchmark for Clinical Foot Segmentation

Vijay, A.; Prabhune, A.; Srihari, V. R.; Rayampalli, A.

2026-07-17 health informatics 10.64898/2026.07.15.26358117 medRxiv

Top 11%

0.0%

Show abstract

We present FootNet, a 453-image multi-view smartphone foot dataset for binary foot segmentation, with expertannotated masks across six anatomical views (dorsal, medial, and plantar, both left and right). We benchmark four segmentation models under a controlled protocol: U-Net with a MobileNetV2 encoder achieves the best performance (IoU 0.9268, Dice 0.9608, 95 % CI [0.9209, 0.9320]); DeepLabV3 with MobileNetV3-Large scores IoU 0.8984 (Dice 0.9449); UNet++ with MobileNetV2 scores IoU 0.8913 (Dice 0.9391); and SAM ViT-B with oracle boundingbox prompt scores IoU 0.9219 on the matched 191-image subset. Bonferroni-corrected Wilcoxon signed-rank tests (k = 6 comparisons) show U-Net significantly outperforms DeepLab (p < 0.001, r = 0.638) and SAM ViT-B with oracle boundingbox (p = 0.005, r = 0.202); UNet++ does not significantly differ from DeepLab (p = 0.062). Connected-component postprocessing yields negligible benefit (mean {triangleup}IoU = +0.0003, 12 of 453 images improved). The extended dataset is available upon request

9

Multilevel Factors Associated with Nonresponse to Patient-Reported Outcome Measures in Routine Radiation Oncology Care

Liu, J. B.; Chen, Y.-J.; Edelen, M. O.; Pusic, A. L.; Martin, N. E.; Zeng, C.

2026-07-17 health systems and quality improvement 10.64898/2026.07.15.26358162 medRxiv

Top 11%

0.0%

Show abstract

Purpose: Nonresponse to routinely collected patient-reported outcome measures (PROMs) threatens the representativeness of aggregated data. We characterized patient-, provider-, and clinic-level factors associated with PROMIS Global-10 nonresponse in routine radiation oncology care. Methods: In this retrospective cohort study, all adults seen at five Mass General Brigham radiation oncology clinics over one year were included. The primary outcome was patient-level nonresponse, defined as never completing the portal-administered Global-10 versus completing it at least once. Using iterative mixed-effects logistic regression, we modeled patient-, provider-, and clinic-level factors. Results: Among 12,214 patients, 71 providers, and five clinics, patient- and appointment-level response rates were 35.4% and 10.9%, with patient-level response ranging nearly fivefold across clinics (12.8% to 66.2%). In Model 1, male sex, lower education, not working, and recent surgery had higher odds of nonresponse, and longer time since diagnosis lower odds. After provider- and clinic-level factors were added, patient sex, education, and employment became nonsignificant, whereas recent surgery (adjusted odds ratio [aOR] 1.97) and longer time since diagnosis (aOR 0.46 for >12 months) persisted. A provider's historical collection rate was protective but attenuated at the clinic level. There, a later program launch (aOR 0.29) and higher historical collection rate (aOR 0.79) correlated with lower nonresponse, whereas academic versus community setting did not. Conclusions: Nonresponse to routinely collected PROMs is a multilevel phenomenon driven substantially by clinic-level implementation factors, not patient characteristics alone. Because response rate is only a proxy for representativeness, PROMs programs and PRO-based performance measures should prioritize representative collection over volume.

10

Rationale and guidance for implementing the continual reassessment method for dose-finding in controlled human infection model studies

Weerasinghe, C.; Osowicki, J.; Simpson, J. A.; Crocker-Buque, T.; McCarthy, J.; Williams, E.; Price, D. J.

2026-07-17 infectious diseases 10.64898/2026.07.16.26358128 medRxiv

Top 11%

0.0%

Show abstract

Controlled human infection models (CHIMs) are increasingly used in infectious disease research to study pathogen dynamics and evaluate interventions under controlled conditions. However, these studies are resource-intensive and involve ethical and safety constraints, making efficient study design critical. Dose-finding is a key early component in CHIMs, where the aim is to identify a challenge dose that achieves a target infection probability. Traditional rule-based designs are commonly used but can be inefficient, motivating the use of model-based adaptive approaches such as the Bayesian Continual Reassessment Method (CRM). Although CRM has been extensively studied and widely adopted in Phase I oncology trials for identifying the maximum tolerated dose of therapeutics, its application in CHIM settings remains limited, particularly when the endpoint of interest is infection. This tutorial provides step-by-step guidance for implementing a Bayesian CRM in dose-finding CHIMs, using an oropharyngeal Neisseria gonorrhoeae challenge as a motivating case study. The framework outlines key design components, including dose-grid specification, dose-response model, prior elicitation, Bayesian updating, decision rules, and stopping criteria, with particular emphasis on a clinically interpretable parameterisation. Trial operating characteristics are evaluated through simulation studies under multiple dose-response scenarios and prior-predictive analyses, and compared with a commonly used '3+3' type rule-based design. This work highlights the advantages of Bayesian model-based designs for dose-finding in CHIMs over classic rule-based designs and provides a structured, reproducible framework for implementing CRM, supporting their application in future CHIM studies.

11

Comparative Efficacy of Vancomycin and Fidaxomicin Regimens for the Prevention of Recurrent Clostridioides difficile Infection: A Systematic Review and Network Meta-Analysis of Randomized Controlled Trials

Prosty, C.; Butler-Laporte, G.; Brophy, J.; Frenette, C.; Loo, V.; Coburn, B.; Hota, S.; Longtin, Y.; Kong, L.; Muller, M.; Steiner, T.; Valiquette, L.; Daneman, N.; Daley, P.; Nott, C.; MacFadden, D. R.; Kandel, C.; Chen, Y.; Perez- Patrigeon, S.; Lee, T. C.; McDonald, E.

2026-07-17 infectious diseases 10.64898/2026.07.14.26358112 medRxiv

Top 11%

0.0%

Show abstract

Background and Aims The optimal treatment for first episodes and first recurrences of Clostridioides difficile infections (CDI) is unknown and there is emerging evidence for pulse and taper (P-T) regimens. Therefore, we sought to estimate the relative efficacy of treatment options. Methods MEDLINE and CENTRAL were searched from database inception to May 21, 2025 and unpublished conference abstracts were searched from recent infectious disease conferences. RCTs on the treatment of first episodes or first recurrences of CDI comparing fixed-dose or P-T regimens of fidaxomicin or vancomycin were included. The primary and secondary outcomes were 40- and 56-day CDI recurrence, respectively. A random-effects network meta-analysis on the risk ratio (RR) scale was conducted using a standard regimen (10-14 days) of vancomycin as the comparator. Treatments were ranked using the surface under the cumulative ranking curve (SUCRA). Results 8 RCTs were included comprising a total of 2181 patients. For 40-day recurrence, fidaxomicin P-T had the highest probability of ranking best (RR=0.10, 95%Confidence Interval [95%CI]=0.10-0.49, SUCRA=1.00), followed by vancomycin P-T (RR=0.49, 95%CI=0.32-0.76, SUCRA=0.61), fixed-dose fidaxomicin (RR=0.61, 95%CI=0.49-0.76, SUCRA=0.39), and, finally, fixed-dose of vancomycin (SUCRA=0.00). The treatments ranked in the same order for 56-day recurrence, though only 3 RCTs reported on this timepoint. Conclusion Vancomycin P-T, fidaxomicin P-T, and fixed-dose fidaxomicin were all superior to a fixed-dose vancomycin. Head-to-head comparative effectiveness RCTs are needed to quantify their relative effect sizes of and impact on long-term prevention of recurrent CDI.

12

Nationwide Mpox Genomic Surveillance Reveals Clade Ib Introductions, APOBEC3-Driven Evolution, and Terminal Deletions

Brochu, H. N.; Shi, Q.; Song, K.; Zhang, Q.; Munroe, J.; Harris, N. J.; Britt, N.; Zeng, Q.; Kapuria, K.; Chappell, J.; Norvell, B. M.; Peavy, L.; Williams, J. D.; Harris, A. B.; Chaitram, J.; Hutson, C. L.; Deng, J.; McGrath, D.; Boles, D.; Dale, S. E.; Gigante, C. M.; Iyer, L. K.

2026-07-17 infectious diseases 10.64898/2026.07.15.26357894 medRxiv

Top 11%

0.0%

Show abstract

Background The 2022-2023 global mpox outbreak highlighted the critical need for robust genomic surveillance capabilities to track mpox virus (MPXV) evolution and transmission dynamics. Methods Building upon our established SARS-CoV-2 sequencing infrastructure, we implemented a Molecular Loop probe-based long-read sequencing approach using Pacific Biosciences Sequel II technology for comprehensive MPXV genomic surveillance across the United States (US). From August 2024 to June 2025, we generated 326 high-quality whole genome sequences from residual mpox-positive clinical specimens collected by Labcorp across all 10 US Department of Health and Human Services regions. Results Our analysis identified two samples containing clade Ib MPXV in January and June 2025 and captured shifting trends in clade IIb diversity, with 13 distinct lineages observed. We also identified multiple instances of large (~1.6-17.6kb) deletions proximal to the inverted terminal repeats in clade IIb genomes. APOBEC3 mutation analysis indicated substantial evidence of human-to-human transmission among both clades. Further, we observed significantly higher APOBEC3-associated SNPs per kilobase (P<0.001) in clade IIb genomic variable regions relative to their central conserved region. Our assay exhibited strong reproducibility across biological replicates from individual patients and accuracy was confirmed via parallel sequencing of select specimens by US Centers for Disease Control and Prevention (CDC) using metagenomic sequencing. We also demonstrated via custom simulation that our assay discriminates all known MPXV clades and lineages, including those we have not observed in the US. Conclusions Our integrated nationwide surveillance system facilitates real-time genomic tracking of outbreak evolution, with demonstrated capacity across SARS-CoV-2 and MPXV, positioning this platform for rapid deployment during future pathogen emergence.

13

Genome-Wide Association Studies and Deep-Learning Functional Annotation of Opioid Use Disorder across Three Ancestries in the All of Us Research Program

Gu, S.; Petrovitch, D.; Hall, O. T.; Lambert, J. W.; Kember, R. L.; Nahid, N. A.; Ma, Q.; Sprague, J. E.; McDonough, C. W.; Johnson, J. A.

2026-07-17 addiction medicine 10.64898/2026.07.15.26358096 medRxiv

Top 11%

0.0%

Show abstract

Background: Opioid use disorder (OUD) is heritable, yet most genome-wide association studies (GWAS) have focused on European populations, leaving the genetic architecture of OUD in non-European populations underexplored. Methods: We conducted GWAS of OUD across three ancestries using electronic health records and genomic data from 52,357 All of Us Research Program participants (8,912 cases; 43,445 matched opioid-exposed controls; 48.5% female). Participants were stratified into European (EUR), African (AFR), and Admixed American (AMR) ancestry groups for logistic regression GWAS, with independent replication in the Million Veteran Program. We then applied the deep-learning model AlphaGenome to predict the tissue-specific transcriptomic and splicing consequences of top risk variants across 13 reward-pathway brain regions. Results: We identified and replicated a novel DDX6 risk locus, alongside established OPRM1 and FURIN signals. AlphaGenome predicted the DDX6 regulatory allele downregulates the stress-resistance gene FOXR1 in the nucleus accumbens, while the protective OPRM1 variant (rs1799971) upregulates OPRM1 expression across reward networks. Other signals of interest included IL6R and SHISA9 (EUR); GHR (AFR); and ASTN2 (AMR). Conclusions: This study identifies DDX6 as a novel OUD risk locus, replicates associations with OPRM1 and FURIN, and highlights biologically plausible ancestry-specific signals in AFR and AMR populations. We also replicated top variants in an independent population. Finally, integrating GWAS with deep-learning annotations provides specific, localized biological hypotheses to guide future experimental validation and targeted therapeutics.

14

Complex intra-host SARS-CoV-2 evolution following monoclonal antibody pre-exposure prophylaxis

Kamelian, K.; Pascall, D. J.; Cheng, M. T. K.; Meng, B.; Altaf, M.; Morse, R. M.; Aggio, J. B.; Egan, D. J. S.; Chen-Xu, M.; Trivioli, G.; Sutton, B.; Richter, A.; Gonzalez-Vazquez, L. D.; Cormie, C.; Kemp, S.; Yeadon, R.; Hyatt, B.; Wong, A.; Thesin Pelamkulangara, N.; Fraser, E.; McCarthy, B.; Novaes, F.; Stott, S.; Galvin, A.; Bellis, K. L.; De Angelis, D.; Harrison, E. M.; Martin, D.; Smith, R. M.; Gupta, R. K.

2026-07-17 infectious diseases 10.64898/2026.07.14.26356329 medRxiv

Top 11%

0.0%

Show abstract

Background: Monoclonal antibodies have emerged as a prophylactic strategy to prevent symptomatic SARS-CoV-2 infection in immunocompromised individuals. However, the evolutionary and clinical implications of breakthrough infections under this regime remain unclear. Methods: A male in their 80s with a haematological/oncological diagnosis received a 2000 mg intravenous infusion of sotrovimab in March 2023 and was diagnosed with COVID-19 by RT-qPCR from a nasopharyngeal swab in August 2023. Weekly samples (n=24) were collected through February 2024 (171 days). All samples underwent whole-genome sequencing, with select mutations subjected to functional assessment. Findings: Sequencing identified the GE.1 lineage at all timepoints. An intra-host recombination event in ORF1ab (positions 8942-12458) was detected prior to 23 weeks post-detection, followed by a 14-fold increase in viral load (7.42e+06 to 1.00e+08 RNA copies/mL) and a marked shift in the viral population. E340D, a sotrovimab resistance mutation, was detected at low abundance (46%) within the first week post-infection, fluctuated over time, and was nearly fixed by week 15 (107 days) post-detection. We assessed five spike mutations - V36M, S98F, and V213G in the N-terminal domain, Y505P in the receptor-binding domain, and P681Q near the S1/S2 cleavage site - and additionally evaluated the impact of E340D. V36M conferred the highest infectivity across all cell lines, with the most significant effect in low-TMPRSS2 cells. While all mutations showed enhanced infectivity with the addition of E340D, the effect was most pronounced in mutations with lower baseline infectivity. The addition of E340D significantly decreased relative neutralizing titres for V36M, S98F, and V213G, enabling escape from neutralizing antibodies in XBB-responsive individuals, illustrating an enhanced phenotypic advantage. Patient neutralizing activity was absent pre-sotrovimab, and sotrovimab-induced neutralization was further compromised by selection of E340D. Interpretation: Sotrovimab pre-exposure prophylaxis in an immunocompromised patient did not prevent SARS-CoV-2 infection, and selected for resistant mutation E340D, with unexpected fitness consequences across non-receptor binding domain spike regions.

15

Elevated BrainAGE precedes cognitive impairment and improves prediction of future cognitive decline

Moradi, E.; Dahnke, R.; Gaser, C.; Rikkonen, T.; Kroger, H.; Vaananen, S.; Solomon, A.; Sund, R.; Tohka, J.

2026-07-17 health informatics 10.64898/2026.07.15.26358150 medRxiv

Top 11%

0.0%

Show abstract

Magnetic Resonance Imaging (MRI) derived brain age varies substantially between individuals, but it remains unclear whether early deviations from normal brain ageing precede future cognitive decline and whether they provide predictive value beyond conventional MRI measures. Here, we investigated whether MRI-derived brain age gap estimation (BrainAGE) identifies early structural brain ageing differences among cognitively normal individuals who later develop mild cognitive impairment (MCI) or dementia. We analysed longitudinal structural MRI data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and replicated the main findings in the population-based Kuopio Osteoporosis Risk Factor and Prevention Study (OSTPRE). Individuals who later converted to MCI or dementia had higher BrainAGE values several years before diagnosis and, in ADNI, showed steeper longitudinal increases than stable individuals. Elevated BrainAGE values were also associated with increased risk of future conversion to MCI in cognitively healthy individuals and faster subsequent memory decline. Cross-sectional differences and the association between BrainAGE and risk of future conversion were replicated in OSTPRE. Importantly, adding BrainAGE to models including demographic, APOE4, cognitive, and MRI-derived measures consistently improved prediction of future cognitive outcomes, with the greatest benefit observed for individuals who converted after longer follow-up. These findings show that structural brain ageing begins to diverge years before the onset of MCI. BrainAGE captures this early divergence, providing complementary information beyond conventional structural MRI measures that may improve the early identification of cognitively normal individuals at increased risk of future cognitive decline when integrated with other biomarkers.

16

Efficient stochastic epidemic simulation via the Sellke construction

van Boven, M.; Bootsma, M. C.

2026-07-17 epidemiology 10.64898/2026.07.16.26358219 medRxiv

Top 11%

0.0%

Show abstract

Stochastic epidemic models are a cornerstone of infectious disease epidemiology and are often used to study intervention scenarios. However, large run-to-run variability can make intervention effects difficult to estimate precisely. We revisit the epidemic Sellke construction, which assigns each individual an infection threshold for the cumulative infection hazard such that, conditional on the thresholds, the epidemic trajectory becomes deterministic. This enables coupling of simulations with and without an intervention, yielding low-variance effect estimates even when outcomes such as final size or peak incidence vary widely between runs. We develop an exact, event-driven implementation that maintains infection and recovery events in priority queues. Cumulative infection-hazard updates require O(log N) time per event, yielding overall complexity O(Elog N) for E events in a population of size N. The implementation achieves computational performance comparable to the classical Gillespie algorithm while naturally accommodating non-Markovian infectious periods and complex infectiousness profiles. We illustrate the approach using distance-dependent spread of avian influenza between poultry farms in the Netherlands and a multilayer population with households, schools, and workplaces. In both examples, coupling enables efficient within-run comparisons of intervention scenarios across stochastic realisations.

17

Bridging surveillance gaps in dengue: a hierarchical model integrating mixed data sources for transmission estimation and vaccine targeting

Djaafara, B. A.; Elyazar, I. R.; Yosephine, P.; Surya, A.; Silalahi, F. S.; Handito, A.; Thohir, B.; Aryani, D.; Gunawan, D.; Nisa, A. K.; Prianto, E.; Samad, I.; Cook, A. R.; Huang, A. T.; Clapham, H. E.; Bhatt, S.; Mishra, S.

2026-07-17 epidemiology 10.64898/2026.07.15.26358208 medRxiv

Top 11%

0.0%

Show abstract

Estimating dengue force of infection (FOI) is essential for understanding transmission dynamics and targeting intervention programmes, yet surveillance data in endemic settings required for estimations are often incomplete, with varying formats. We developed a Bayesian hierarchical catalytic model that jointly fits age-stratified case data, aggregate case data, and seroprevalence surveys within a single framework, incorporating external covariates to improve parameter identifiability. Synthetic validation showed that covariates alone recovered accurate FOI point estimates even when most districts contributed only aggregate data, but did so with poorly calibrated uncertainty; anchoring the model with a single seroprevalence survey was necessary to bring credible interval coverage close to nominal. Applied to 128 districts across Java and Bali, Indonesia (2016-2024), the model revealed substantial spatial heterogeneity in FOI and reporting rates. Many districts in Java exceeded the WHO-suggested seroprevalence threshold for vaccine introduction, yet were classified as low-priority when using reported incidence as prioritisation criterion, particularly in areas with weak surveillance. Model-based seroprevalence estimation, integrating multiple data sources, offers a more consistent basis for identifying high-priority districts for vaccine introduction, and is less susceptible to surveillance bias than reported incidence.

18

Neonatal admission as a marker of risk for poor educational attainment and special educational needs in children aged 5-11 years

John, A.; Pike, C.; Olga, L.; Sovio, U.; Wong, H. S.; Smith, G. C.; Aiken, C.

2026-07-17 pediatrics 10.64898/2026.07.15.26358132 medRxiv

Top 11%

0.0%

Show abstract

Background: Children born prematurely (before 37 weeks) or admitted to the neonatal unit (NNU) are at increased risk of adverse long-term physical health outcomes. It is also recognised that there is an association with later academic performance and special educational needs, however it is not clear whether these broad risk factors could be used as stand-alone heuristics to identify children who may benefit from additional support in educational settings. We aimed to examine the associations between neonatal unit (NNU) admission and educational attainment in mid-childhood. Methods and Findings: Pregnancy data from a prospective birth cohort (Pregnancy Outcome Prediction Study, Cambridge, United Kingdom, 2008-2012) were linked to national educational outcomes (Department for Education, United Kingdom). Multivariable regression models adjusted for maternal, child, and socioeconomic factors were used to evaluate associations between (i) all NNU admissions, (ii) at term NNU admissions >48 hours, (iii) preterm birth without ongoing physical health needs, and educational outcomes at ages 5-11 years. Children who required any NNU care were more likely not to meet expected educational standards across multiple ages and domains in early and mid-childhood: age 5 early year foundation (aOR 1.64, 95% CI 1.19-2.27, p=0.003), phonics at age 6 (aOR 2.43, 95% CI 1.72-3.57, p<0.001), and at age 7 (here assessments were divided into multiple domains): reading (aOR 1.67, 95% CI 1.18-2.38, p=0.004), writing (aOR 1.72, 95% CI 1.25-2.38, p<0.001), mathematics (aOR 1.56, 95% CI 1.09-2.22, p=0.020), and science (aOR 1.85, 95% CI 1.22-2.78, p=0.003). Similar patterns were observed among both at term-born infants who stayed >48hrs in NNU (phonics assessment at age 6 aOR 2.26, 95% CI 1.51-3.36, p<0.001) and in children born preterm without long-term physical health sequelae (phonics assessment at age 6 aOR 3.07, 95% CI 1.96-4.81, p<0.001). These associations were robust to adjustment for demographic, perinatal, and socio-economic factors. By age 11, differences in academic attainment were attenuated and no longer clearly distinguishable across all exposure groups. However, there was an increased likelihood of special educational needs (SEN) at age 11 associated with any NNU admission (aOR 1.78, 95% CI 1.15-2.73, p=0.009), at term NNU admission for >48hrs (aOR 1.88, 95% CI 1.19-3.00, p=0.007), and children born preterm without long-term physical health sequelae (aOR 1.50, 95% CI 1.00-2.25, p=0.049). Predictive performance of any NNU admission for SEN at age 11 was moderate (AUC 0.70, 95% CI: 1.14-2.65, p=0.010), with balanced sensitivity and specificity and high negative predictive value. Conclusions: NNU admission, for both term and preterm infants, is associated with poorer educational outcomes and an increased likelihood of special educational needs in mid-childhood.

19

General Practice Perspectives on Post-Infection Conditions: Scoping Review and UK Survey

Aung, K. W.; Scuffell, J.; Podlasek, A.; Engamba, S.; Jones, F.; Edwards, A.; Chew-Graham, C. A.; Sanyaolu, L.; Busse-Morris, M.

2026-07-17 primary care research 10.64898/2026.07.15.26358157 medRxiv

Top 11%

0.0%

Show abstract

Background Post-infection conditions (PICs), such as Long Covid, are associated with heterogeneous, fluctuating symptoms that profoundly affect daily functioning. Despite moderate-certainty evidence from the NIHR-funded LISTEN trial (COV-LT2-0009) that personalised self management support improves outcomes and may reduce societal and economic impacts of Long Covid, many people living with PICs still receive condition-specific services, generic advice, or stand-alone digital tools that do not address their complex needs. Aim To map care approaches in general practice and synthesise UK evidence for PIC management. Design and setting Scoping review and online survey. Method A two-phase study was conducted: (1) a scoping review of UK evidence on PIC management in general practice; and (2) a supplementary online survey of practitioners working in UK general practice to provide contextual insights. Results The scoping review identified 32 studies focused on Long Covid. One study included a comparator group (ME/CFS). Study populations were predominantly white ethnicity and female. Evidence for non-Covid PICs in UK general practice was largely absent. The supplementary survey (n=46) provided preliminary practice-level insights. Healthcare practitioners reported varied PIC presentations, diagnostic uncertainty, limited referral pathways, inequitable access, and low confidence in managing PICs. Conclusion Evidence informing PIC management in UK general practice remains predominantly Long Covid-focused and may not reflect the range of PICs encountered in practice. While survey findings are preliminary and require confirmation in larger samples, they highlight uncertainty around PIC management. Further research is needed to evaluate whether existing Long Covid pathways should be expanded or complemented by broader PIC models. Keywords general practice; Long Covid; self-management; post-viral syndromes

20

Temporal relationships between distress and pain in people living with HIV

Arendse, G.; Kamerman, P.; Wadley, A.; Edwards, R. R.; Joska, J.; Parker, R.; Madden, V. J.

2026-07-17 primary care research 10.64898/2026.07.15.26358133 medRxiv

Top 11%

0.0%

Show abstract

Objective: There is a bidirectional relationship between emotional distress and pain. However, this relationship is understudied in people with HIV in low-resource settings. This study sought to describe the temporal relationship between emotional distress and pain in people with HIV. Design: Longitudinal observational study. Methods: Participants with virally suppressed HIV, reporting either no pain or persistent pain at baseline, provided weekly remote ratings of distress, worst pain, and average pain using 0-10 visual analogue scales. Within-individual fluctuations in distress and pain were visualised over time. Group-level correlations were determined using Spearman's correlation tests. Cumulative link mixed models assessed whether distress and pain each predicted the other in the following week. Results: 72 participants provided responses over 49 weeks. The participants had a median (IQR) age of 43 (37-51) years, 63% (n=45) were unemployed and most were females (n=51;71%). Distress and pain fluctuated concurrently within individuals: distress was positively correlated with worst pain ({rho}=0.66, 95% CI= 0.60-0.72, p<0.001) and average pain ({rho}=0.70, 95% CI=0.64-0.75, p<0.001) intensity within the same week. Worst pain (OR=1.42, 95% CI=1.17-1.71, p<0.001) and average pain (OR=1.43, 95% CI=1.20-1.71, p<0.001) intensity both predicted distress in the next week. Distress predicted worst pain intensity (OR=1.25, 95% CI=1.07-1.46, p=0.023) but not average pain intensity (OR=1.19, 95% CI=1.01-1.40, p=0.152) in the next week. Conclusions: The temporal relationship between distress and worst pain intensity was bidirectional, whereas distress did not temporally predict average pain intensity. Both pain and emotional distress should receive attention from HIV research and clinical care in low-resource settings.